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ABSTRACT 

This paper explores two aspects of student 
evaluations of college teaching: (1) a reformulation of end-of-term 

ratings, by defining them in relation to initial student 
expectations; and (2) it presents a research design which studies 
this retormulation in the natural setting or the classroom. An 
expectations instrument, using semantic dirterential scales, was 
administered at the beginning and end of a semester to 209 social 
science students at three colleges. Some findings were: (1) there 

were statistically significant differences between expectations and 
evaluations, even for those classes where the actual evaluations were 
quite high; (2) freshmen and sophomores at all three colleges 
exhibited similar expectations; (3) sophomores, rather than freshmen, 
consistently revealed higher evaluations; and (4) compared with 
sophomores, freshmen expectations appeared more in keeping with their 
evaluations. It was concluded that an end— of- course rating can be a 
generally reliable indicator of student reaction, but an accurate 
appraisal of the dynamics ot the teaching-learning situation requires 
input of teacher and student expectations. (RN) 
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ABSTRACT 



A Reassessment- of College Students 1 
Instructional Expectations and Evaluations 
Donald J. Calista 

Chairman, Behavioral and Social Sciences Department, Bennett College 

Questioning the conventional end-of-course ratings of professors, 
an evaluations model was introduced to extend the notion of ratings to 
incorporate measures of student classroom expectations. As a working 
framework for classroom role structure, the study employed Stogdill’s 
(1959) group achievement formulation to distinguish between those expec- 
tations associated with institutional dynamics — defined as situational — 
and those related with the classroom — defined as contextual. Test at the 
beginning and retest at the closing of a semester was administered Lo 209 
students in 8 social science classes at two two-year colleges and one 
four-year college. Developed from previous Critical Incident and factor 
analysis techniques, the 20-item instruments used modified Semantic Dif- 
ferential scales. The major hypothesis found statistically significant- 
differences bein'.’ ecu each class’ expectations and evaluations, even for 
high evaluations. Year in college was not a significant intervening 
variable for either expectations or evaluations. If the evaluations 
demonstrated the direction of expectations,^ but failed to indicate their 
intensity, the study concluded that extrapolations made solely from eval- 
uations were of questionable value in measuring the range of student- 
teacher interaction. Redefining the expcctations-evaluations framework 
seemed appropriate and further research suggestions were offered. 
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If the literature on student ratings of college courses dates from 
the 1920s (Costin ct al. , 1971), its impact on educational practice is 
far from conclusive.* Despite the greater use of statistical refinements, 
there is little, if any, agreement as to what the -ratings actually mea- 
sure (Cohen and Drawer, 1969). Is it teaching or learning, or some com- 
bination — subtle or otherwise — of the two that is judged? Often re- 
searchers skirt any theoretical — even semantic — issues by not discussing 
whether student ratings are in fact evaluations. It seems that end-of- 
term ratings per sc should now be extended to incorporate certain goal- 
setting and monitoring mechanisms. Only in this way can the traditional 
notion of ratings become a more inclusive concept of evaluations. This 
paper offers two alternatives to the present unsettled state of student- 
evaluations of college teaching. First, it seeks to reformulate end-of- 
term ratings, by defining them in relation to initial student expectations. 
And, secondly, the paper presents a research design that employs this 
reformulation. 



E xpcctati ons-Fvqluations Framework 

If there exists substantial literature on college student expectations 
related to institutional dynamics, similar research directed toward college 
courses remains scant (Stern, 1970). Nonetheless, to differentiate between 
them, those expectations related to institutional press as explored by 
Stem (1970:6-7), may be called si tnati unal and those expectations directed 

*My special thanks i or generous help in completing this study are due 
to J. William Nystrum, Charles L. i'alermo and Max Weiner . 
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toward the classroom, contextual * Their essential difference being tluit 
situational expectations may be more diffuse and generalized, whereas 
contextual expectations are related more to group objectives and role 
expectations. This clarification seems necessary as much of the research 
contovcrsy over the problem of student situational expectations stems from 
the unclear distinction made between situational and contextual categories 
which, at least, for etiological purposes, must be identified and con- 
sidered separately (ef. Feldman and Nov, ’comb, 1969, 1 :78-80). To make that 
distinction, contextual expectations may be considered as those role and 
group objectives students generate in relation to college courses and 
instructors. A further separation also needs to be struck with consensual 
expectations as discussed by Newcomb (1962). The key difference between 
conscnsual--and contextiinl--expcctations is the former f s open-ondedness 
or self-defining qualities, especially in relation to peer group member- 
ship (Newcomb, 1962:469-475; cf. Gross ct al* , 1958:48-69). 

Although there is considerable work on contextual expectations in 
small-group research (Cartwright and Zander, 1968; Berger et al., 1969), 
there is little understanding of how they relate to college classroom 
dynamics. The most suggestive formulation for this set of contextual 
expectations has been provided by Stogdill (1959) in his general work on 
group expectations. Stogdill defines expectation as a "readiness for 
reinforcement which TT is a function of drive, the estimated probability 
of occurrence of possible outcome, and the estimated desirability of out- 
come” (1959:62). By drive he means "the level of tension reactivity 
exhibited by an organism, 11 which would become activated into readiness 
for reinforcement defined as "the extent to v/hicn an individual is pre- 
pared or unprepared to experience, or reconciled to the prospect of 
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experiencing, a possible out ei.mr. " In turn, actual outcomes would be 
mediated by an individuals predictions jiwhfmenl:, or guess rolalive to 
the .likelihood that a given event wil.l occur," as well as by Iho esti- 
mated desirability which lie defines as "an i liil.i vi dual’s Judgment relative 
to the sati sfyingness of, need lor, demand for, appropri at enoss of, or 
unpleasantness uf, a possible outcome 11 (1959 ;62). In the classroom, 
these two estimates determine student expectations associated with pro- 
jected outcomes of teacher behaviors. Aware that some quest ion might be 
raised regarding the positive or negative valence of the estimate?;, 
Stogdill adds that the "estimate of probability and estimate of desir- 
ability are not opposite ends of the same c nitimuin. . . . 11 Instead, these 
estimate's "interact to determine the level of expectations." And, most- 
relevant to the classroom, "this inter ad .ion is formulated in terms of 
what is uncertain in the future as well as in relation to what has been 
learned in the past" (1959:1123). 

If this concept of expectations can be understood as an interaction 
of probable* and desirable estimates, the particular behaviors are depen- 
dent upon the classroom role structure. Accordingly, Stogdill first dis- 
tj.ngaishcs, between the place of the formal structure of groups, making it 
possible to determine the different expectations actually associated with 
individual instructors by students. As such, Stogdill discriminates 
between expectations "attached to position rather than the occupant of a 
position." Since the classroom 1 s formal structure prescribes the pro- 
fessors status and function which "are defined for a position rather 
than for any given occupant of the position," Stogdill emphasizes the 
relevance of cx.poctnt ions related to rule* structure — and this is most 

i 

appropriate hero (1959:12.'}). lor, if the ideal professor studies — which 
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complement the instructor ratings research— have identified distinct- 
classes of role behavior states associated by students with instructional 
expectations (Terry, 1969; Lewis, 196 8), then, examining the interplay 
of these traits seems to be the next logical research step. 

Consequently, expectations as outlined by Stogdill have been used in 
this paper as the role-related set of probable and desirable estimates of 
instructor behaviors for which students exhibit a readiness of reinforcement. 
This formulation permits classroom role expectations — and subsequent eval- 
uations — to be understood as both a function of the normative aspects of a 
classroom coupled with individual instructor behavior characteristics. 

The interaction of student estimates of desirable and probable instruc- 
tional behavior outcomes, in other words, not only determines evaluations, 
as the ratings or ideal professor studies suggest, but it also activates 
student expectations (cf. Berger et al. , 1969). Simply, the concept of 
expcctations employed here includes measuring the readiness for projected 
outcomes. 

By contrast, it seems that the model adopted by typical end-of- 
course ratings studies actually could impede an understanding of student 
expectations in relation to their evaluations. By excluding measures of 
probability and desirability these studies neglect to incorporate the 
expectations that ordinarily precede evaluations. Nor can expectations 
be assumed to be inherent in evaluations, for this has not been empirically 
established at all. According to the definition of expectations used here, 
for example, an affirmative or negative end-of-term rating in any instruc- 
tional behavior category could have exceptional moaning only if initial 
expectation inputs are taken into account. In the ratings studies, how- 
ever, there seems to be no way to ascertain whether students expect a 
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teacher to he friendly, even though he, in fact, turns out to be quite 
friendly. The student or class coaid be, nonetheless, expecting to 
achieve more organized knowledge about subject matter for which the 
students rate this less affirmatively, 'flic degree to which either sheds 
light on actual student expectations may be limited. In the ratings 
studies — and more so in practice — the tendency would be to total the two 
results, perhaps, one equalizing or cancelling the oilier. Extrapolations 
of student expectations made from their evaluations could be, in short, 
of questionable value. 




Research Design 

Even when two recent researchers attempted to recast the typical end- 
of-course ratings design, both stuck to surveying college students about 
their expectations at the completion of the course (Phillips, 1964: Twa , 
1970). In keeping with the reformulation of the expectations-evaluations 
framework now being piropjosed. this design was longitudinal and multi- 
institutional. Following Gage (1961:17), it further proposed relating 
"changes in student [sj — the dependent variable — to some measure of the 
teacher r s behavior or classroom experience." An expectations instrument, 
later modified for evaluations, was administered at the start of a semes- 
ter to 209 students about equally divided into S introductory social 
science classes in 3 colleges. Students were not informed that the in- 
strument would be retaken for evaluations purposes at the end of the 
semester. The three colleges were chosen because each shares the explicit 
institutional goal of stressing the teaching function. Alpha is a well- 
established two-year college for women enrolling about 400 students; Beta 
a newer comprehensive community college enrolling about 2,500 students; 
and Gamma is a 100 year old co-educational four-year college enrolling 
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about 1,600 students. Thu colleges are .located within a 100 rule radius 
of New York City. 

Hie instrument employed developed principally from two different 
analytical ratings techniques. One of the most widely used techniques 
can be found in the University of Michigan studies (Isaacson, 1963 ; 
McKeachie, 1963; McKeachie, 1969), which applied factor analysis to items 
gathered from previously used ratings instruments, from sucli a pool of 
145 items, these studies derived 34 significant items (Isaacson et al. , 
1963:345). The other research technique identified items through the 
Critical Incident method (Ryans, 1960:79-83). from structured and ur struc- 
tured procedures Owen identified an inventory of critical incidents stu- 
dents associate with college instructors’ behaviors. His study further 
demonstrated a high degree of intercorrelation between its Critical Inci- 
dent items and the Michigan ones (1967:140). Thus, the items for the 
present study’s cxpectations-cvnluations instrument were selected both 
from Owen’s study and from Michigan’s pool. In all, a total of 20 items 
were used for the instrument. These items were, in turn, clustered within 
the six Critical Incident instructional behavior categories (1967:47).^ 
Because both the items and the category clusters were identified from 
estimates of desirable and probable behaviors associated with instructional 
roles, they can serve as a working approximation of contextual expectations. 

By comparison, if the Critical Incident and factor analysis techniques 
provided highly correlated instruments items, there is little consistency 



-4rhc six instructional behavior categories employed were: I- Present- 

ation of Material-Content, Structure, and Scope; TT.-Prosontati on~of Material- 
Student J’artici pat. ion; ] IJ-Urecontatlr.n of Mat.-viTa- Instructor ' s Style; 1\- 
Tcacher and Student Rapport -(Ju.-.s Interaction; V-Kvaluatiori of Students; 
and, ^-Requirements of Students. 
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jn scaling techniques for cnd-of-courso ratings research. Though variety 
in scaling practice can be advant ageous , there lias been no substantial 
attempt on the part of ratings' researchers to integrate their work ' *1 1 it 
previous ones (Coe tin et al., .1971). for example, i f one study adopted 
a nine-point sentence completion technique with the opposite poles b< in,: 
some variation of: "I found the textbook in this course to he: 1) Very 

dull" to "9) Very interesting," (McClelland, 1970:89) another used adjec- 
tives such as "probing" or "sympathetic" on an open-ended seven point 
scale (Turner et al. , 1969:67). A need for more uniform scaling technique 
seemed obvious. 

One scaling technique already employed by two recent investigators, 
Lewis (1968) and Rocs (1969), was the Osgood Semantic Differential (1957). 
In employing a seven-point scale, the semantic differential permitted 
greater amplitude at the extremes, while incorporating a mid- point as well. 
The substantive criticism raised concerning its scale had been the usual 
one regarding the neutrality of tins mid-point (Snider and Osgood, I9n9). 

Yet it appeared, from recent evidence, that this did not impair the use- 
fulness of the technique (Oetting, 1967). Despite their somewhat different 
designs, the Lewis and Roes studies suggested promising applications of the 
semantic differential to evaluations research. Lewis used the bipolar 
adjectives, for example, to determine whetlur college professors were 
perceived in uniformly Sterotypic terms, and foun.i that different disci- 
plines evoked significantly divergent images in students (1968:189). 

Unlike these studies, however, the current study employed only one hi]*) Im- 
pair of adjectives per instrument item, for instance, "What do you expect 
the instructor’s knowledge uf the subject to be"? 
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Superior : : : : : : : Inferior 

In addition, in order to determine the appropriateness of the expecta 1 ions- 
evaluations instrument tin* last item of each recorded student, appraisal of 
the instrument. This, measure, usually absent from ratings i n:;t rn.sieiit 
read: How do you rate the ability of this survey to determine your own 

expectations Revaluation's/ this course? 

Strong : : : : : : : Weak 

Though this was certainly not an internal control, it would st ill provide 
some indication of the effectiveness of the Semantic I)i fferent ial . 

Hypotheses Cons i derod . 

The major hyj>othesis would naturally compare* the expectations with the 
subsequent evaluations. Stated in tin.* null form: One , within any class 

there would be no statistically significant difference in student expecta- 
tions in the six Critical Incident categories and the subsequent student 
evaluations, because year in college is so prominent in research related 
to institutional press (situational expectations), three hypotheses were 
raised regarding college year. IVo , freshmen would not exhibit statistically 
significant higher expectations compared with other classmen in any one of 
the six Critical Incident categories; t hree , freshmen would not exhibit 
statistically significant higher evaluations compared with other classmen 
in any one of the six Critical Incident categories; and, four , there would 
be no statistically significant difference between freshmen expectations 
and their evaluations comjKirrd with other classmen in any one of the six 
Critical Incident categories. 

The question of year in college relates speci fi rally to the notion of 
tliu ’’freshman Myth" (ftem, 1 >7i :17b). In his studies dealing with silu- 
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ational expoetat funs, he found that freshmen 'Mooi; forward to high levels 
of activities relevant to both academic and nun-academic press....’ , 
’’tin's does nut corrosjiond to tin* actual characteristics of ^tlieir^ schools 
at all? (1970:02). liven though these studies included general questions 
about faculty instruction and courses, lu»w close ihe resulting (’inclusions 
would he when contort ua.l expectations were df fferentiated from the situ- 
ational ones remained an open question. Hence, the three hypotheses 
involving year in college. 

Findings and Iti scansion 

Since the data will he presented in toms of the Critical Incident 
instructional categories, they will he abbreviated as follows; I — Content; 
II — Participation: III — Sty! a; IV- -Rapport, V — evaluation: and, VI — 
Requirements. It should also he mentioned that Alpha's classes are cited 
as A, li, C and I): Feta’s as F and F: and, Gama's as G and !l. For each of 
the hyjiothe.se:;, two-tailed tests v.vro used because flic* null form did not 
include a prediction of direction. Statistical significance was minimumly 
accepted at tin.* .05 level of judgment. 

The firs — and tnaj or — hypothesis considered the changes in each sample 
class. To test iiic null hyjuithesjs of no differences between expectations 
and evaluations for each Critical Incident category, grouji means were com- 
pared using the 1 test between correlated gimiis (Mueller et al_. , 1970:417). 
In each case degrees of freedom Were determined by the number of test pairs 
minus one. Hie relevant data can he found in Tables 1, 2 and G. In Beta 
and Gama colleges significant differences were J’oinui in virtually all of 
the categories in each of tin* classes and college totals. Most of the 
d i l l ■ is nee at tne .o.) level. Alpha v *11 eve classes reported some 

significant d 5 iTorcnces in three c the four classes. Hie college total 
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produced significant differences in four of six Critical Incident 
categories. In all, 30 out oi 48 cxpectntions-evul uations Critical In- 
cident comparisons reached acceptable levels of statistical significance. 
In only 4 instances in classes A and B did the mean differences for the 
evaluations outweigh the expectations; of these, one instance produced a 
significant difference in the Participation category of class A. Thus, 
the null hypothesis was rejected for the alternative that differences 
were to be found between student contextual expectations and their 
evaluations. 

An analysis of the ranks for the expectations and evaluations cate- 
gories confirmed tha alternative hypothesis, too. This analysis might- 
help answer questions raised regarding the reliability of the study 
population. Data reported in Table 4 indicated the rank order (rho) com- 
putations for the Critical Incident expectations scales in each of the 
colleges. The rank differences between the colleges were negligible. 
Furthermore, the relative constancy of the students' expectations can be 
seen by comparing the rank order (rho) differences of their expectations 
and their subsequent evaluations. Table 5 reported this data. Despite 
the varying group mean differences actually recorded by each class, the 
students ranked the six Critical Incident categories in a rather con- 
sistent fashion. This seemed remarkable even if one were faced with the 
fact that the students’ expectations were met with uneven outcomes. The 
rho ratios for the Critical Incident categories for Alpha college was .93, 
for Beta .76, and for Gamma .94. Certainly this would serve as a confir- 
mation of the substantive hypothesis that expectations influence the way 
students record their evaluations of instructors. 
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J\! 10 - . 90 with Alpha and Gamma Collcf'cs. 
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Requirement 4 (G. 0 ) 4 (G. 0 ) 4 ( 5 . 5 ) 4 ( 4 . 2 ) 4 ( 5 . 7 ) 



Tills conclusion seemed to be supported by noting student reactions 
to the expectations and evaluations instruments. This data — gathered 
from responses regarding the efficacy of the instruments — can be found in 
Table 6 by sex, with a test for correlated means computed for differences 
The only difference actually noted did indicate that the evaluations in- 
strument proved to be a better indicator of student responses. Again, 
despite the significant differences between actual expectations and eval- 
uations, student response to the study instruments remained relatively 
constant. 

To test the second hypothesis that there would be no difference 
between freshmen and other classmen, group means for each Critical In- 
cident expectations category were computed and reported in Table 7. 

Because there were so few third and fourth year students in each intro- 
ductory social science course, the data were grouped by freshmen and 
sophomores. The £ test for uncorrelated — or independent-means — (Mueller 
et al., 1970:407) was used for determining statistically significant dif- 
ferences. Degrees of freedom were determined by summing up the two test- 
samples minus two. Since no differences were noted, the null hypothesis 
was therefore accepted, which in this case, served as the substantive one. 
Clearly, the freshmen and sophomores reported similar contextual expec- 
tations. Even if one discounts the existence of an "upper- classmen fr 
effect for the two-year college students, no differences were reported by 
the four-year college sophomores. 

The third hypothesis that there would be no differences in freshman 
and sophomore evaluations was similarly tested and cited in Table 8. As 
can be readily seen, the only differences noted wore for sophomores — and 
these were significantly greater. Although sophomore evaluations were 



TABLE G. —Means and Standard Deviations of Student Responses to Part IT., Item 
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significantly higher in these nine instances, in all but one, their 
evaluations remained higher than freshmen. Here again, the null hypoth- 
esis was accepted, for, when differences appeared sophomore evaluations 
— not freshman — produced them. 

Hypothesis four tested the proposition that there would be no signif- 
icant differences between freshmen expectations and evaluations as com- 
pared with sophomores. The group means for this comparison by college 
can be found in Table 9. Here the _t test for correlated means was employed. 
Since there were significant differences to the .001 level for both fresh- 
man and sophomore evaluations the null hypothesis became untenable. This 
would confirm the substantive hypothesis of this study — students record 
their classroom experiences in similar ways. Taking Beta college, for 
example, it did not make much difference whether freshmen had lower expec- 
tations than sophomores, for the actual experiences produced significantly 
lower evaluations for both groups. To sum up, the three hypotheses regarding 
college year, both freshmen and sophomores recorded expectations and eval- 
uations that seemed to relate each other’s reality rather than their year 
in college as such. 

Conclusions 

As a study of student contextual expectations in relation to their 

% 

evaluations in the natural setting of the classroom, the findings sup- 
ported the substantive hypotheses, namely, that end-of-course ratings 
offered only a general understanding of expectations. This was affirmed 
in the analyses of each class and college group means as well as rank 
order comparisons for the Critical Incident expectations and evaluations 
categories. More importantly, the findings demonstrated that statistically 
significant differences emerged between expectations and evaluations, even 

t 1 /■ / 



TABLE 9 . — Means 
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for those classes whore tlio actual evaluations would probably qualify a:: 
being reasonably high. In such instances, as for classes A anil C in 
Table 1, significant differences occurred in half r»f ihe categories, if 
evaluations wore the only measure for those classes in Tables ?. and 
the lower expectations, in comparison with still lower — and significant 
— evaluations would surely escape notice. So, despite the fact that the 
students in these social science classes exhibited similar direit io ns for 
contextual expectations and evaluations, the actual int onsity of these 
factors varied. In other words, it now seemed safer to say that extra- 
polations from evaluations regarding the actual nature of student-teacher 
classroom interaction could be of questionable value. 

The dichotomy between expectations and evaluations appeared more 
prominent concerning year in college. Here the data substantially indi- 
cated that freshmen and sophomores exhibited similar contextual expecta- 
tions in all three colleges. Regarding evaluations, however, it was the 
sophomores, not the freshmen who consistently revealed higher evaluations 
— and, in some cases statistically significant ones as reported in Table 9. 
Tliis did certainly question the applicability of relating the findings 
from situational — or generalized — expectations to classroom contexts 
(Wallace, 1906; Stem, 1970). If a "freshman Myth" existed for the cur- 
rent study’s groupings, then it applied to sophomores, too. Somewhat of 
an inversion took place: compared with sophomores, freshmen expectations 

appeared more in keeping with their evaluations. As proposed in the con- 
ceptual framework outlined earlier, these findings support the need for 
differentiating between the operation of sets of college-related expec- 
tations. In this regard, it n.iuht be noted th.it even in feldman and 
Newcomb’s comprehensive study, no attempt was made to distinguish between 
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differing se I n of student export at ions (196V, 1:78-82), such ax situ- 
ational and context ii.-iH olios. 

Urns, ax i vi i*l of any future research agenda, it would be iinporLam* 
to Irani iiow students report each rot n r expert at ions. Arc the differ- 
ences in bo found in the perception of cadi sot? To bo sure*, the findings 
of this study— in beeping with ftogdill T x framework — indicated that flic 
role and peril ov voice aspects: of a classroom would c:.loar.ly account for 
: cmio of Ibo possible differences in reporting font r»x1 na.l expectations. 

In m im, tlio dynamics of classroom pari io i put joii compared w.itb .institu- 
tional Membership have rli.11 to bo clarified. 

Wliiit implications did Ibis study rovoa.l fur ourrontly accepted < val- 
uat ions? If r \ I ujt ions sirajOy r;« anl a reaction to the inxt rurl.j.ona.! 
process. Ilion the end-of-c« 'irro ratings might still serve a valuable 
function, for, as tins study confined— in keeping with previous research-- 
an end-el -ounr: o ratine can tie a generally reliable indicator of student 
react ion. but. if texting : 1 udejil ~t eacher interaction becomes desirable, 
then, tie* appropr i, at ♦ •n*s r of expect at ion inputs would be obvious. Clearly, 
presently conceived ciid-uf-cour. c ratings have built-in limitations. In 
add it ton, these ratines could ovate the false Impression that the students 
in any mu* class remain inert, especially if an instructor received similar 
ratings over time*. Certainly for thi.s srudy T s groups, the expectations 
dimension would have added to an mulerslauding of teacher-student inter- 
action, or lack of it. 

llie problem of classroom interaction raised a question concerning the* 
contextual expectations model. If this approach inherently called for 
greater part ici ;..d i.»n b\ otvbnt and t«*ach«r in defining classroom objec- 
tives, its c*a*oJlaiy wi.uJd be the increased awareness of each other 1 ** 
expect ations. Hence, it might b(» argued that in order to establish the 
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credibility of this model, the study had to demonstrate the obvious. That 
iffj student expectations would be different from their evaluations. If 
this was so, it resulted from the fact that end-of-course ratings charac- 
teristically failed to include the intensity of student expectations. 
Obvious or not, the findings of this study would discourage any evaluations 
cast in isolation from expectations. 

Recommendations 

If this study was limited by the in-group homogeneity of the student 
samples, it might be appropriate to replicate the design in other multi- 
institutional settings with a cross-section of students. Since most end- 
of-course ratings research has taken place in state colleges and univer- 
sities, it would be advantageous to introduce the expectations dimension 
into those settings, too. If single institutions must be used, then the 
sample should bo heterogeneous. Additional work in the area of instru- 
mentation seemed appropriate. Although this study suggested Osgood’s 
Semantic Differential technique to overcome the increased proliferation 
of instruments, perhaps other scales — some already in use — could be cor- 
related with it. The thrust of any future refinements, however, should 
focus on validating cxpectations-cvaluations instruments. 

Improvements in the study design could be suggested, too- One would 
be to introduce certain controls into the design itself. For example, it 
could now prove worthwhile to manipulate the classroom situation — the 
independent variable — by employing the concept of contextual expectations 
into some groups only. Although many factors would require control, it 
would be important to learn the effects of this approach in actual class- 
rooms. Tii i s typo oJ : experiment could be conducted in larger institutions 
with rtlaN.ve ease. In small colleges further controls would have to be 






introduced through a multi -institutional design. 
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Further, studies might consider the intervening variables of class 
size and differing subject areas in relation to the expectations-eval- 
uations approach suggested here. Longer- range longitudinal studies would 
be needed to determine the impact of actually employing this approach on 
both students and teachers. Similarly, comparisons between the more in- 
novative curricula and colleges with the more conventional approaches 
would certainly help determine whether contextual expectations could 
become an acceptable classroom model. Studies of this type would also 
help evaluate the systems approaches now being suggested for higher edu- 
cation (Axelrod, 1969). 

By way of conclusion, it might be so id 'that despite the rapidly 
changing college scene, the research scene might still lack reliable know- 
ledge regarding its sine qua non : the learning scene. The situation re- 
mained not too different from an observation of ten years ago: TT . . . a 

straight-forward attack on the area of student-instructor role dynamics 
in the college cultural setting is yet to be made" (Dick. 1961:415). 
Fortunately, a corrective was recently issued calling for a multi-dis- 
ciplinary field to study the parameters of classroom dynamics, appropriately 
named the sociology of learning (Boocock, 1966:1). If its limits were 
still somewhat undefined, examining the interplay of contextual and situ- 
ational expectations with related sociological variables in the college 
setting should be part of its research schedule. The concept of a sociology 
of learning would thereby give needed direction and shape to classroom 
evaluations, including the contextual expectations dimension. 
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